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2 Fast detection of communication patterns in distributed executions 
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November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies on 
Collaborative research 

Full text available: ||[pdu421_.M3i Additional Information: fujLcitatjon, .abstract, references, index.terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on process- 
time diagrams are often used to obtain a better understanding of the execution of the application. The 
visualization tool we use is Poet, an event tracer developed at the University of Waterloo. However, 
these diagrams are often very complex and do not provide the user with the desired overview of the 
application. In our experience, such tools display repeated occurrences of non-trivial commun ... 
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October 1976 ACM SIGARCH Computer Architecture News, volume 5 issue 4 
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August 1988 ACM Transactions on Computer Systems (TOCS), volume 6 issue 3 

Full text available: ^pd«3.4S MB) Additional Information: full citation , abstract , references , citings , index terms 

The Intel 432 is noteworthy as an architecture incorporating a large amount of functionality that most 
other systems perform by software. It has, in effect, "migrated" this functionality from the software into 
the microcode and hardware. The benefits of functional migration have recently been a subject of intense 
controversy, with critics claiming that a complex architecture is inherently less efficient than a simple 
architecture with good software support. This paper examines t ... 
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6 DSSE: a programmable macro engine for customizing applications 
Marc L. Corliss, E. Christopher Lewis, Amir Roth 

May 2003 ACM SIGARCH Computer Architecture News , Proceedings of the 30th annual 

international symposium on Computer architecture, volume 3i issue 2 
Full text available: ^pd:'C335,3G.KB) Additional Information: full. citation, abstract, references 

Dynamic Instruction Stream Editing (DISE) is a cooperative software-hardware scheme for 
efficiently adding customization functionality— e.g, safety/security checking, profiling, dynamic code 
decompression, and dynamic optimization— to an application. In DISE, application customization 
functions (ACFs) are formulated as rules for macro-expanding certain instructions into parameterized 
instruction sequences. The processor executes the rules on the fetched instructions, feeding the 
executi ... 
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Philip C. Treleaven, David R. Brownbridge, Richard P. Hopkins 
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June 1986 ACM SIGARCH Computer Architecture News , Proceedings of the 13th annual 

international symposium on Computer architecture, volume 14 issue 2 
Full text available: 'H pdf(945.42 KB) Additional Information: full citation, abstract , references, index terms 

The G-machine provides architecture support for the evaluation of functional programming languages by 
graph reduction. This paper describes an instruction fetch unit for such an architecture that provides a 
high throughput of instructions, low latency and adequate elasticity in the instruction pipeline. This 
performance is achieved by a hybrid instruction set and a decoupled RISC architecture. The hybrid 
instruction set consists of complex instructions that reflect the abstract architecture ... 

9 A^uantiMlve^evaluatlo 
D. Strube 

January 1989 Proceedings of the conference on Tri-Ada '89: Ada technology in context: 
application, development, and deployment 

Full text available: * g| pdfM.45 M3) Additional Information: full citation , abstract , references , index terms 

Projects considering the use of Ada for embedded weapon systems have performance as their primary 
concern. Despite the genuine interest throughout industry to commit to Ada for reasons of portability and 
maintainability, the language is still viewed with reservations when being considered for use in 
applications with high interrupt throughput or tasking requirements. There is increased demand from 
users for quantitative data on Ada's ability to handle deadlines in realtime. Such data can be ... 
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November 1980 Proceedings of the 13th annual workshop on Microprogramming 
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A user-microprogrammable computer has been developed for use as a building block in general-purpose 
and dedicated computer systems. The architecture is designed to be easily microprogrammed and 
features a 32-bit, vertically oriented microinstruction. The processor has a 135-nanosecond cycle time, 
either 16- or 20-bit macro data paths, and 1024 hardware registers. A significant fraction of the 
processor bandwidth may be budgeted for I/O processing to allow the substitution of microcode for e ... 
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12 Virtual machine monitors: Xen and the art of virtualization 

Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, Ian Pratt, 
Andrew Warfield 

October 2003 Proceedings of the nineteenth ACM symposium on Operating systems principles 

Full text available: ^.pdf{158,75 K3j Additional Information: Ml citation, abstract, references, index terms 

Numerous systems have been designed which use virtualization to subdivide the ample resources of a 
modern computer. Some require specialized hardware, or cannot support commodity operating systems. 
Some target 100% binary compatibility at the expense of performance. Others sacrifice security or 
functionality for speed. Few offer resource isolation or performance guarantees; most provide only best- 
effort provisioning, risking denial of service.This paper presents Xen, an x86 virtual machine monit ... 

Keywords: hypervisors, paravirtualization, virtual machine monitors 



13 The design and development of a dynamic program behavior measurement tool for the Intel 

8086/38 

R. J. Schwartz 

June 1989 ACM SIGARCH Computer Architecture News, volume 17 issue 4 

Full text available: ^pdf(906.01 KB) Additional Information: fuii citation, abstract, index terms 

When modeling a computer system, it is necessary to study the system's dynamic behavior. Examples of 
this behavior are branching frequency and operating system usage patterns. The dynamic properties of 
the system characterize its performance. Models make assumptions about such behavior, but require 
genuine data to validate the assumptions. In this article, we present a measurement tool that will collect 
and analyze dynamic program information for the Intel 8086/88. The design and development of ... 
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Paul Racunas, Yale N. Patt 

June 2003 Proceedings of the 17th annual international conference on Supercomputing 

Full text available: ^ pd?n91.74 KB) Additional Information: full citation , abstract , references, index terms 

The high clock frequencies of modern superscalar processors make the wire delay incurred in moving 
data across the processor chip a significant concern. As frequencies continue to increase, it will become 
more difficult for a centralized first level data cache to supply the timely data bandwidth required by 
superscalar processors.This paper presents a complete solution for the partitioning of the first level of 
the memory hierarchy. The first level data cache is split into several independent pa ... 

Keywords: clustered microarchitecture, partitioned cache 
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This paper describes the BL addressing mode and the address tag in FLATS2 machine, which is a 
general-purpose MIMD computer now under construction. The BL addressing mode integrates memory 
accessing and range checking by hardware. Address tag is a bit in word, which indicates the capability 
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for memory access. Combining them together, efficient memory protection is provided at run-time. It 
reduces the cost of run-time type checking in Lisp by checking the address tag and the address of a 
poi ... 

16 Streamlining data cache access with fast address calculation 
Todd M. Austin, Dionisios N. Pnevmatikatos, Gurindar S. Sohi 

May 1995 ACM SIGARCH Computer Architecture News , Proceedings of the 22nd annual 
international symposium on Computer architecture, volume 23 issue 2 

Full text available: ^i>dftl,5S.M3i Additional Information: fu!|.cjtat|p.Q, abstract., references, citings, index tejnns 

For many programs, especially integer codes, untolerated load instruction latencies account for a 
significant portion of total execution time. In this paper, we present the design and evaluation of a fast 
address generation mechanism capable of eliminating the delays caused by effective address calculation 
for many loads and stores. Our approach works by predicting early in the pipeline (part of) the effective 
address of a memory access and using this predicted address to speculatively access the ... 

17 Pipeline Architecture 
C. V. Ramamoorthy, H. F. Li 

January 1977 ACM Computing Surveys (CSUR), volume 9 issue i 

Full text available: ^.pdf(3,53 MB) Additional Information: MLeMloa, references, citings, inde* terms 
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G. J. Lipovski 

April 1978 Proceedings of the 5th annual symposium on Computer architecture 

Full text available: ^ pdf(906 06 KB) Additional Information: full citation , abstract , references , citings , index terms 

A Context Addressed Segment Sequential Memory (CASSM) was built to evaluate several techniques for 
nonnumeric processing. Herein, it is described from the architectural point-of-view. The basic 
architecture is introduced, information, data and storage structures are discussed, and the principles of 
disc searching used by CASSM are discussed. 

19 Three simulator tools for teaching computer architecture: Little Man computer and RTLSim 
Cecile Yehezkel, William Yurcik, Murray Pearson, Dean Armstrong 

December 2001 Journal on Educational Resources in Computing (JERIC), volume i issue 4 
Full text available: ^pdfC8l32 J5.KB). Additional Information: full. citation, abstract, citings, index j^ms 

Teaching computer architecture (at any level) is not an easy task. To enhance learning, a critical mass of 
educators has begun using simulator visualizations of different computer architectures. Here we present 
three representative computer architecture simulators for learning which show that there is a growing 
consensus for computer simulation as a teaching tool for complex dynamic processes, such as underlying 
computer operations. Simulators also show the wide spectrum of pedagogical g ... 

Keywords: Computer architecture simulators, education 
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Microarchitecture 

Full text available: ~ ii| 
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Publisher Site 

Master/Slave Speculative Parallelization (MSSP) is an execution paradigm for improving the execution 
rate of sequential programs by parallelizing them speculatively for execution on a multiprocessor. In 
MSSP, one processor— the master— executes an approximate version of the program to compute 
selected values that the full program's execution is expected to compute. The master's results are 
checked by slave processors that execute the original program. This validation is parallelized by 
cutting ... 
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